Automated security threat testing of web pages

ABSTRACT

A method of security testing a web application is presented. The method identifies a web application to be tested, determines potential security vulnerabilities of the web application, generates one or more security tests for testing the potential vulnerabilities, and executes the security test on the web application. The results of the security testing are then used to make the web application less vulnerable to security attacks.

CROSS REFERENCE TO RELATED APPLICATIONS

[0001] This application claims priority under 35 U.S.C. §119 (e) to provisional application serial No. 60/355,186 filed Feb. 7, 2002, and to provisional application serial No. 60/397,524 filed Jul. 22, 2002 the disclosures of which are hereby incorporated by reference.

STATEMENT REGARDING FEDERALLY SPONSORED RESEARCH

[0002] Not Applicable.

FIELD OF THE INVENTION

[0003] The present invention relates generally to the security of web pages and more specifically to a method of making web pages less vulnerable to security attacks.

BACKGROUND OF THE INVENTION

[0004] Security attacks of web applications are fairly common occurrences. A web application typically includes one or more web pages, and vulnerability on one web page can compromise the web application or even he web site running the web application. A web page may be vulnerable to many types of attacks.

[0005] Security testing of web applications is difficult and time consuming. Security of a web application is often times not specified as part of the application. Specifying security is very difficult, and implementing security measurements may not be correct. Security testing of a web application is also difficult. Furthermore, security testing of web applications requires extensive domain-specific knowledge. Also, new methods of breaking into web sites are found regularly, leading to the need to continuously track new security vulnerabilities.

[0006] One type of attack involves cookie stealing. Cookies are defined as small data files written to a user's hard drive by some web sites when the user views the website by way of a browser. These data files contain information the web site can use to track such things as passwords, lists of pages the user has visited, and the date when the user last looked at a certain page. The stealing of cookies can be used to gain access to a user's account on a web site or to provide information (such as credit card information) regarding the user. Once this information gets into the wrong hands, the attacker can access the web site and the user's information can be used by the attacker to perform a variety of tasks.

[0007] Another type of attack is known as a System Query Language (SQL) attack. Web applications typically use data from a user to construct SQL queries. In some instances a simply constructed query leads to a vulnerability wherein a user can execute arbitrary SQL queries against a database and acquire information such as passwords, social security numbers, credit cards numbers and the like.

[0008] An attack of a web site can be performed simply, by people who are generally non-technical. For example, it was recently reported that several Internet commerce sites using a particular shopping cart application were vulnerable. A user would enter the site and select items to purchase by placing them in to the application shopping cart. The user could then save the form to a local disk drive, edit the price in the form with a text editor then reload the form back into the web browser. The user would then purchase the items for the price entered by the user, instead of the actual price.

[0009] In view of the above, it would be desirable to have a method that provides testing and analysis of possible security threats to a web page so that appropriate action can be taken to make the web page less vulnerable to potential security attacks.

SUMMARY OF THE INVENTION

[0010] A method of security testing a web application is presented. The method identifies a web application to be tested, determines potential security vulnerabilities of the web application, generates a security test script for testing the potential vulnerabilities, and executes the security test script on the web application.. the results of the security tested are then used to make the web application less vulnerable to security attacks..

BRIEF DESCRIPTION OF THE DRAWINGS

[0011] The invention will be more fully understood from the following detailed description taken in conjunction with the accompanying drawings, in which:

[0012]FIG. 1 is a block diagram of a system of the present invention; and

[0013]FIG. 2 is a flow chart of the method of the present invention.

DESCRIPTION OF THE INVENTION

[0014] The present invention comprises a method to determine potential security vulnerabilities in a web page, to then provide security attacks to the web page, and to record the attack results. By way of the present invention the recorded attack results are then analyzed and used to make the web page less vulnerable to attacks.

[0015] The owner of a web site may be concerned that the web site page response is vulnerable to hackers, user error or security threats. Security threats include cookie stealing, SQL attacks, script attacks and the like. A recent analysis of web application vulnerabilities indicated that the forty-five applications surveyed generating $3.5 billion in annual revenues had on average eleven security defects each. Many of these defects could have been caught earlier in the cycle, either by having a security design focus or in performing “Security Quality Assurance,” which many companies do not perform.

[0016] Referring to FIG. 1, a block diagram showing the components of the present invention and other components used to provide the automated generation of security test scripts for a web application. A Functional Test Script Generator 10 is used to provide a Functional Test Script 20. The functional test script generator 10 may be a test product such as E-Test™ available from Empirix Inc. of Waltham, Mass. E-test™ generates one or more functional test scripts which access the web application 50 and simulate user interaction with the web application 50.

[0017] The functional test scripts 20 are provided to a Security Test Script Generator 30. The security test script generator 30 derives security test scripts 40 from the functional test scripts 20 to test potential vulnerabilities of the web application 50. A security test script is generated for each type of vulnerability. A security test script 40 may include a data bank 50 associated with the security test script 40.

[0018] The security test script 40 (and data bank 45) is then executed which test the security of the web application. The results of the execution of the security test scripts 40 are then, evaluated to determine which vulnerabilities exist in the web application 50 and corrective action can then be taken to make the web application more secure.

[0019] A flow chart of the presently disclosed method is depicted in FIG. 2. The rectangular elements are herein denoted “processing blocks” and represent computer software instructions or groups of instructions. The diamond shaped elements, are herein denoted “decision blocks,” represent computer software instructions, or groups of instructions which affect the execution of the computer software instructions represented by the processing blocks.

[0020] Alternatively, the processing and decision blocks represent steps performed by functionally equivalent circuits such as a digital signal processor circuit or an application specific integrated circuit (ASIC). The flow diagrams do not depict the syntax of any particular programming language. Rather, the flow diagrams illustrate the functional information one of ordinary skill in the art requires to fabricate circuits or to generate computer software to perform the processing required in accordance with the present invention. It should be noted that many routine program elements, such as initialization of loops and variables and the use of temporary variables are not shown. It will be appreciated by those of ordinary skill in the art that unless otherwise indicated herein, the particular sequence of steps described is illustrative only and can be varied without departing from the spirit of the invention. Thus, unless otherwise stated the steps described below are unordered meaning that, when possible, the steps can be performed in any convenient or desirable order.

[0021] The method 100 begins at step 110. This step starts the process, and includes any preliminary tasks that need to be done such as initiating any parameters, or the like. Following step 110, step 120 is executed wherein the web application to be tested is identified. An example of a web application to be tested for vulnerability to security attacks may be a banking application wherein a user connects to the banks web site through a browser, access his or her account by entering a username/password combination and can determine an account balance. The example is a very simple one, used here to help describe the present invention. It should be appreciated that web applications are typically much more complex.

[0022] The next step to be executed is step 130 wherein the web application is analyzed. This is preferably done by using a functional test script that may have been previously generated. For the current example, a functional test script which accesses the bank web site, enters a user name and password combination, then navigates through the web application to check the account balance of the account associated with the usemame/password combination may have been previously used to verify the functionality of the web application. This step may also be performed manually, by a user using a browser to navigate through the website.

[0023] Following step 130, step 140 is executed. At step 140 potential security vulnerabilities of the web application are identified. In the simplified example used here, potential vulnerabilities may include easily guessed username/password combinations, cookie manipulation, session ID generation, and the like.

[0024] At step 150, one or more security tests are generated for testing each of the potential security vulnerabilities uncovered in step 140.

[0025] The security tests are executed at step 160. In accordance with the example web application being described, a first security test tries to login to the bank website using various usemame/password combinations. The usemame/password combinations are stored in a data bank associated with the security test. A second security test is run wherein a specific test script is run some number of times and the cookies from each session are logged. A third test is run wherein a specific test script is run some number of times and the session IDs from each session are logged and analyzed for simplistic session ID generation algorithms.

[0026] At step 170 the results of the security tests execution are analyzed. The login attempts performed by the first security test are analyzed to see if any of the attempts was successful, indicating that a certain account may be compromised. The log of the cookies generated during execution of the second security test is examined for plaintext information and for non-plaintext information such as encoding commonly available on Windows and UNIX platforms. This indicates that cookie data should be handled differently. The session IDs logged during the execution of the third security test are analyzed for simplistic session ID generation algorithms. A simple session ID generation algorithm can result in manipulation of session IDs by hackers attempting to gain access to a web applications data. Following step 170, the method ends at step 180.

[0027] The present invention provides security test generation for web applications. By way of the presently described invention tests which test the security of web applications are generated. These security tests are then run to test potential security vulnerabilities of the web application, and the results are analyzed to determine which vulnerabilities exist, such that corrective action can be taken to make the web application more secure from security attacks.

[0028] The above example used only a few security tests, however there are a large number of security tests that can be run on a web application and which can provide valuable information relating to how secure a web application is.

[0029] The following description includes several potential vulnerabilities that may be associated with a web application, and further describe how to test for the vulnerability.

[0030] A first vulnerability test area for web applications involves session management. One test for session management involves the use of simplistic Session IDs. Testing to determine simplistic session IDs involves running a specific test script some number of times. Between each iteration, all cookies, cache and other state are cleared. The session IDs from each session are logged. The log of session IDs is analyzed for simplistic session ID generation algorithms. The user is able to specify the number of iterations and the test scripts used to apply this security test. If the user specifies a databank of login/password pairs, that list is iterated through and repeated as necessary. In addition, all variants of this test use the login/password from the test script repetitively. The returned pages are examined for server and login errors; session IDs should not be logged if there are login errors.

[0031] Another session management test involves non-expiration of session IDs. A specific test script is run some number of times. Between each iteration, all cookies, cache and other state are cleared. The session ID from the first iteration is logged. When the server returns a new session ID, the session ID from the first iteration is substituted for the current session ID. The user is able to specify the number of attempts to reuse the session ID and how many times to iterate over the set (and thereby get a new session ID). The user is able to specify where unique session data resides in the script. The test will have been considered failed if the pages returned in any subsequent session contain the same data as in the first session (the server accepts the session ID and does not return a session expired error).

[0032] Another session management test involves the protection of session data. A specific test script is run some number of times concurrently. Each instance of the script will have its own cookies, cache and other state. The session ID from each instance is logged. The concurrent script instances rotate session IDs and attempt to access data in the other instance's session. The user is able to specify the number of concurrent instances of a script. The user is able to specify where unique session data resides in the script. The test will have been considered to fail if the data accessed by one instance can be accessed from any other instance of the script.

[0033] Still another session management security test involves cookie manipulation. A specific test script is run some number of times. Between each iteration, all cookies, cache and other state are cleared. The cookies from each session are logged. The log of cookies is examined for plaintext information. The user is able to specify the number of iterations and the test scripts which to apply this test. The log of cookies is examined for vulnerabilities such as uuencoded and rotX fields, and other encoding commonly available on UNIX and Windo provisional application serial No. 60/355,186 filed Feb. 7, 2002 ws platforms.

[0034] A second vulnerability test area for web applications involves authentication/access control. One test for authentication/access control involves the use of default account/passwords. This test will examine a test script for any parameters which function as logins (which may have a set of common names). For each potential login found, it will generate a test script that uses an external databank that is pre-populated with account name/password pairs. The test will also parse fields found within comment fields in the HTML file to look for login password pairs. Anything found will be populated in the data bank. The user may edit the databank to specify additional accounts and passwords. The user may also specify some common algorithmic modifications of the databank entries, such as upper/lower case, reversal, and number postfixing.

[0035] Another test for authentication/access control involves the use of default scripts. This test will examine a test script for the potential location of any server-side scripts and attempt to test all those possible locations, in addition to default server install directories, for the existence of any standard scripts. For each script found, this test will attempt to exploit them using other known methods by generating derivative test scripts for each script/vulnerability pair. This test will report and log (for other tests) the results of its search. The user is able to specify the type of server that they are testing in order to narrow the potential scripts tried, and any directories which the user can specify to force a search.

[0036] Another test for authentication/access control involves the use of debug options on scripts. For each script found and logged (or just re-run that portion of previous test) and any parameters found in an input test script, the test will generate a script that submits standard post and debug queries/posts (such as ?debug=on). The user is able to augment the existing set of known default options.

[0037] A third vulnerability test area for web applications involves of input validation. One test for input validation involves buffer overflows. The test will examine a test script and locate all post and query parameters, and generate a separate test script for every parameter found. The value of the parameter is replaced with an extremely long string. The user is able to specify the length of the overflow string. The user may specify a long mode (one script per parameter) or a short mode (one script per test script web page).

[0038] A fourth vulnerability test area for web applications involves parameter tampering. One test for parameter tampering involves reordering parameters. The test script is analyzed, and for all instances where there are two or more post or query parameters, produce a test script which permutes the order of the parameters is produced, with a limit of permutations per parameter set. The user may change the permutation limit.

[0039] A second test for parameter tampering involves deleting parameters. The test script is analyzed, and for all instances where there are one or more post or query parameters, it will produce a set of test scripts which deletes one of the parameters at a time until there are none left, or the number of scripts equals the permutation limit (default 16). The user may change the permutation limit.

[0040] A third test for parameter tampering involves adding parameters. The test script is analyzed, and for all instances where there are post or query parameters, it will generate a test script where there is a new parameter inserted at the beginning of the post or query string. The user may specify the name and value of the new parameter.

[0041] A fifth vulnerability test area for web applications involves hidden parameter manipulation. A first test for hidden parameter manipulation involves changing parameter values. The test will take an input script, and for every parameter found, will generate a script that changes the value of that parameter. The test will fail if the value is found to have changed on the resultant HTML page. The user specifies data on an HTML page which is tied to the parameter.

[0042] A second test for hidden parameter involves manipulating JavaScript™ parameters. The test will look for URLs embedded in JavaScript and perform the same parameter manipulations in the URLs as standard parameter tampering.

[0043] A sixth vulnerability test for web applications involves script tampering. A first test for script tampering involves script corruption. The test script is analyzed for the presence of SQL query strings that are in hidden parameters or post actions.

[0044] Another test for script tampering includes the cross-site scripting. The test script is analyzed for the presence of form input fields that are posted to the server. It will generate a set of test scripts for every form input field that contains embedded script code and examine the return pages to look for the presence of the same script code.

[0045] Another vulnerability test area for web applications involves file/application enumeration. A first test for file/application enumeration includes directory indexing. The test will take as an input a test script file. For every directory found in the input, it will produce a page in a test script to see if directory indexing is turned on and there are no index.html files. If the script is able to find a directory auto index, the script will produce a warning for every directory that is indexable.

[0046] A second test for file/application enumeration includes access control fault testing. The test will take as an input a test script file. The script is examined for potential login places, and generates a new test script which eliminates the login page and attempts to navigate directly to subsequent pages. The test will pass if the server does not allow this navigation to occur. The user is able to specify the page at which the login to privileged information occurs, and which pages are supposed to be protected.

[0047] A third test for file/application enumeration includes mirror directories. The test will take as an input a test script file. For every directory found in the input, it will produce a page in a test script to see if hidden mirror directory names exist in that directory. There will be a set of common hidden directory names. If the script is able to find that a hidden directory exists, the script will produce a warning for every hidden directory that is found. The user is able to specify the names of hidden directories to search for.

[0048] A fourth test for file/application enumeration includes backup files. The test takes as an input a test script file. For every page found or linked to in the input script, it will produce a page in a test script to see if hidden backup copy of that page exists in that directory. There will be a set of common backup file postfixes. If the script is able to find that a hidden backup file exists, the script will produce a warning for every hidden backup file that is found. The user is able to specify the names of hidden backup file postfixes.

[0049] A fifth test for file/application enumeration includes common files. The test takes as an input a test script file. For every directory found in the input, it will produce a page in a test script to see if hidden files corresponding to files automatically generated by common applications exist in that directory. There will be a set of common application file names. If the script is able to find that a hidden file exists, the script will produce a warning for every hidden file that is found. The user is able to specify the names of hidden files to search for.

[0050] A sixth test for file/application enumeration includes unprotected web traversal. This test will try to navigate to directories outside of the web file structure.

[0051] A method of providing security test script generation has been described. The method analyzes a web application for potential vulnerabilities. Once the potential vulnerabilities are identified, the present invention generates one or more security tests to test for all the potential vulnerabilities identified. The results of security test script execution are then analyzed to determine if potential vulnerability in fact exists, such that the web application can be modified to remove the security vulnerability.

[0052] Having described preferred embodiments of the invention it will now become apparent to those of ordinary skill in the art that other embodiments incorporating these concepts may be used. Additionally, the software included as part of the invention may be embodied in a computer program product that includes a computer useable medium. For example, such a computer usable medium can include a readable memory device, such as a hard drive device, a CD-ROM, a DVD-ROM, or a computer diskette, having computer readable program code segments stored thereon. The computer readable medium can also include a communications link, either optical, wired, or wireless, having program code segments carried thereon as digital or analog signals. Accordingly, it is submitted that that the invention should not be limited to the described embodiments but rather should be limited only by the spirit and scope of the appended claims. All publications and references cited herein are expressly incorporated herein by reference in their entirety. 

What is claimed is:
 1. A method of security testing a web application comprising: identifying a web application to be tested; identifying potential security vulnerabilities of the web application; generating a security test for testing at least one of said potential vulnerabilities; executing said security test on said web application; and analyzing results of said executing said security test.
 2. The method of claim 1 further comprising using the results of said executing said security test to improve security of said web application.
 3. The method of claim 1 wherein said identifying potential security vulnerabilities includes analyzing a path through the web application.
 4. The method of claim 1 wherein said web application comprises one or more web pages.
 5. The method of claim 1 wherein said potential security vulnerabilities include at least one vulnerability selected from the group comprising session management vulnerability, authentication/access control vulnerability, input validation vulnerability, parameter tampering vulnerability, hidden parameter manipulation vulnerability, script tampering vulnerability, and file/application enumeration vulnerability.
 6. The method of claim 5 wherein said session management vulnerability comprises at least one of simple session ID generation, non-expiration of session IDs, protection of session data, and cookie manipulation.
 7. The method of claim 5 wherein said authentication/access control vulnerability comprises at least one of default accounts/passwords, default scripts, and debug options on scripts.
 8. The method of claim 5 wherein said input validation vulnerability comprises buffer overflow.
 9. The method of claim 5 wherein said parameter tampering vulnerability comprises at least one of reordering parameters, deleting parameters and adding parameters.
 10. The method of claim 5 wherein said hidden parameter manipulation vulnerability comprises at least one of changing parameters, manipulating parameters, and manipulating Java Script parameters.
 11. The method of claim 5 wherein said script tampering vulnerability comprises at least one of script corruption and cross-site scripting.
 12. The method of claim 5 wherein said file/application enumeration vulnerability comprises at least one of directory indexing, access/control faults, mirror directories, backup files, common files and web travel.
 13. A computer program product comprising a computer usable medium having computer readable code thereon for security testing of a web application comprising: instructions for identifying a web application to be tested; instructions for identifying potential security vulnerabilities of the web application; instructions for generating a security test for testing at least one of said potential vulnerabilities; instructions for executing said security test on said web application; and instructions for analyzing results of said executing said security test.
 14. The computer program product of claim 13 further comprising instructions for using the results of said executing said security test to improve security of said web application.
 15. The computer program product of claim 13 wherein said instructions for identifying potential security vulnerabilities includes instructions for analyzing a path through the web application.
 16. The computer program product of claim 13 wherein said web application comprises one or more web pages.
 17. The computer program product of claim 13 wherein said instructions for identifying potential security vulnerabilities include instructions for identifying at least one vulnerability selected from the group comprising session management vulnerability, authentication/access control vulnerability, input validation vulnerability, parameter tampering vulnerability, hidden parameter manipulation vulnerability, script tampering vulnerability, and file/application enumeration vulnerability.
 18. The computer program product of claim 17 wherein said session management vulnerability comprises at least one of simple session ID generation, non-expiration of session IDs, protection of session data, and cookie manipulation.
 19. The computer program product of claim 17 wherein said authentication/access control vulnerability comprises at least one of default accounts/passwords, default scripts, and debug options on scripts.
 20. The computer program product of claim 17 wherein said input validation vulnerability comprises buffer overflow.
 21. The computer program product of claim 13 wherein said parameter tampering vulnerability comprises at least one of reordering parameters, deleting parameters and adding parameters.
 22. The computer program product of claim 13 wherein said hidden parameter manipulation vulnerability comprises at least one of changing parameters, manipulating parameters, and manipulating Java Script parameters.
 23. The computer program product of claim 13 wherein said script tampering vulnerability comprises at least one of script corruption and cross-site scripting.
 24. The computer program product of claim 13 wherein said file/application enumeration vulnerability comprises at least one of directory indexing, access/control faults, mirror directories, backup files, common files and web travel. 